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ABSTRACT 

We study the statistical properties of magnification perturbations by substructures in strong lensed systems 
using linear perturbation theory and an analytical substructure model including tidal truncation and a continuous 
substructure mass spectrum. We demonstrate that magnification perturbations are dominated by perturbers found 
within roughly a tidal radius of an image, and that sizable magnification perturbations may arise from small, 
coherent contributions from several substructures within the lens halo. The root-mean-square (rms) fluctuation of 
the magnification perturbation is ~ 10% to ~ 20% and both the average and rms perturbations are sensitive to the 
mass spectrum and density profile of the perturbers. Interestingly, we find that relative to a smooth model of the 
same mass, the average magnification in clumpy models is lower (higher) than that in smooth models for positive 
(negative) parity images. This is opposite from what is observed if one assumes that the image magnification 
predicted by the best-fit smooth model of a lens is a good proxy for what the observed magnification would have 
been if substructures were absent. While it is possible for this discrepancy to be resolved via nonlinear perturbers, 
we argue that a more likely explanation is that the assumption that the best-fit lens model is a good proxy for the 
magnification in the absence of substructure is not correct. We conclude that a better theoretical understanding 
of the predicted statistical properties of magnification perturbations by CDM substructure is needed in order to 
affirm that CDM substructures have been unambiguously detected. 

Subject headings: cosmology: theory - dark matter - galaxies: formation, halos, structure 



1. INTRODUCTION 
1.1. Evidence for Small-Scale Features in Strong Lenses 

Standard lens models often have difficulties explaining the 
relative fluxes of multiply-imaged sources (Metcalf & Zhao 
2002). Perhaps the simplest explanation for these discrepan- 
cies is that they are electromagnetic in origin. Observed fluxes 
may be affected by obscuration due to dust, scintillation in 
the galaxy, or some other form of electromagnetic phenomena 
(e.g., Koopmans 2003). However, there is strong evidence that 
these are not the only source of the discrepancies. In particular, 
Kochanek & Dalai (2004) have shown that positive-parity im- 
ages in observed lenses tend to be brighter than what the stan- 
dard lens models predict, while negative-parity images tend to 
be demagnified. Electromagnetic effects do not distinguish be- 
tween positive-parity and negative-parity images, so we are left 
to conclude that flux anomalies must be produced by gravity. 
That is, the lensing potentials of real galaxies are not fully char- 
acterized by the simple models used to describe them. 

The next simplest explanation for these discrepancies is that 
standard lens models are either overly simplistic or overly re- 
strictive (Kawano 2004; Evans & Witt 2003, though see also 
Yoo et al. 2005). For instance, environmental effects are often 
modeled as a constant, external shear, whereas detailed model- 
ing of the lens environment might be necessary (Mtiller 2002; 
Keeton & Zabludoff 2004). Again, there is strong evidence that 
this is not the only difficulty. 

The first piece of evidence that other difficulties must exist 
concerns the so-called cusp relation. Cusp lens configurations 
occur when a source is located near one of the cusps of a lens' 
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tangential caustic. They characteristically have a tight cluster 
of three images and a significantly more isolated fourth image. 5 
The cusp relation relates the signed fluxes T[, of each of the im- 
ages (labeled by index i), where the sign of the flux is given by 
the parity of the image. For cusp configurations, one can define 
the cusp parameter R = li/Q^i I^D' where the sum is over 
the three tightly-clustered images. For a point source located 
near the cusp, one expects to find R m so long as the lensing 
potential is featureless on scales comparable to or smaller than 
the typical image separations (Schneider & Weiss 1992; Za- 
kharov 1995; Gaudi & Petters 2002). Keeton et al. (2003) have 
shown that careful analysis can identify lenses where the con- 
dition R ss is violated, providing strong evidence that small- 
scale features in the lensing potential are present. A similar ar- 
gument can be made for fold lenses where one expects a close 
pair of images with equal and opposite fluxes in addition to two 
isolated images. This fold relation is likewise violated in a large 
number of lenses (Keeton et al. 2005). Note that while such de- 
tections are model independent and robust, they require further 
model-dependent analysis to determine possible causes of the 
perturbation and to identify which image (or images) is (are) 
perturbed (e.g., Mao & Schneider 1998; Keeton 2001; Dobler 
& Keeton 2005). 

A second piece of evidence for small-scale structure in lenses 
is the frequency dependence of the flux ratios of multiply- 
imaged sources (Moustakas & Metcalf 2003). This argument 
hinges on the fact that features in the lensing potential smaller 
than the source size tend to be smoothed out by the source. 
As an illustration, consider two spatially-coincident sources of 
known brightness, one large and one small, lensed by an inter- 
vening lens potential with features on scales intermediate be- 
tween the two source sizes. In such a case, the smaller source 
is strongly affected by these intermediate features, whereas the 
larger source is not because the observed image flux constitutes 

5 A fourth image may be absent in the so-called "naked cusp" lenses, which 
occur when the cusp of a tangential caustic is not contained in the region interior 
to the radial caustic in the plane of the source. 
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an average over a region larger than the scale of the features. 
Therefore, the existence of the features is signaled through dif- 
ferent observed magnifications for each of the two sources. For- 
tunately, Nature has been kind enough to provide exactly this 
kind of setup. In particular, the spatial extent of the emission 
region of a quasar depends on the particular frequency at which 
the quasar is observed (e.g., Kembhavi & Narlikar 1999). By 
comparing flux ratios of multiply-imaged quasars in different 
spectral regions, one may search for structure in the lens poten- 
tial on length scales intermediate between those of the quasar 
emission regions. Again, more detailed modeling is necessary 
to determine the nature of the perturbation (Metcalf 2004). 

1.2. CDM Substructure as a Possible Source for Small-Scale 
Features in the Lensing Potential 

To our knowledge, Mao & Schneider (1998) were the first 
to propose that intermediate-mass-scale substructures within a 
lens could explain the problem posed by anomalous flux ra- 
tios. The Cold Dark Matter (CDM) paradigm of cosmological 
structure formation (e.g., Blumenthal et al. 1984; White & Rees 
1978) predicts just such substructure. In recent years, a new 
generation of numerical simulations of structure formation in 
the CDM paradigm revealed that the dark matter halos that are 
believed to host galaxies generally have ~ 10% of their mass 
in distinct, gravitationally-bound, substructures commonly re- 
ferred to as subhalos or satellite halos (e.g., Klypin et al. 1999; 
Moore et al. 1999; Ghigna et al. 2000; De Lucia et al. 2004; 
Diemand et al. 2004; Gao et al. 2004), and it was quickly real- 
ized that these subhalos are an ideal candidate for the source of 
the lensing perturbations (see Moore et al. 1999). Conversely, 
estimates of the abundance of substructure can serve as a test 
of models of cosmological structure formation (e.g., Dalai & 
Kochanek 2002b; Zentner & Bullock 2003). 

The amount of substructure predicted by the CDM model 
has been the subject of numerous recent studies. One apparent 
discrepancy between theory and observation is that there are 
more than an order of magnitude fewer dwarf satellite galax- 
ies about the Milky Way and M3 1 than the number of subha- 
los of comparable velocity dispersion predicted by the CDM 
paradigm (Kauffmann et al. 1993; Klypin et al. 1999; Moore 
et al. 1999). This mismatch is known as the "missing satellites 
problem," and many possible resolutions have been considered. 
Several authors have proposed modifications to the properties 
of the dark matter, making it "warm" rather than cold (Hogan 
& Dalcanton 2000; Colin et al. 2000; Bode et al. 2001 ; Lin et al. 
2001 ; Knebe et al. 2002) or introducing a self interaction for the 
dark matter (Spergel & Steinhardt 2000) in order to reduce the 
amount of substructure. 

Alternatively, any suppression in the amount of small-scale 
power in the primordial spectrum of density fluctuations can 
significantly influence halo structure (Zentner & Bullock 2002; 
McGaugh et al. 2003; van den Bosch et al. 2003) and may re- 
duce the abundance of subhalos (e.g., Kamionkowski & Liddle 
2000; Sigurdson & Kamionkowski 2004). Zentner & Bullock 
(2003) studied in detail the dependence of halo substructure on 
the primordial power spectrum and showed that even mild mod- 
ifications to the power spectrum on small scales can greatly 
affect the severity and interpretation of the missing satellites 
problem. These authors also estimated projected subhalo mass 
fractions for several different models, demonstrating how mea- 
surements of this quantity through lensing flux anomalies (e.g., 
Dalai & Kochanek 2002a) could provide information about the 



nature of the dark matter and the primordial power spectrum, 
and thus inflation, on scales that may be otherwise inaccessi- 
ble. 

The most conservative solution to the missing satellites prob- 
lem is that star formation may be naturally suppressed in a 
large fraction of small halos due to feedback from supernovae 
(e.g., Dekel & Silk 1986; Kauffmann et al. 1993; Cole et al. 
1994; Somerville & Primack 1999) or from the ionizing back- 
ground (e.g., Rees 1986; Efstathiou 1992; Kauffmann et al. 
1993; Shapiro et al. 1994; Thoul & Weinberg 1996; Bullock 
et al. 2000; Somerville 2002; Benson et al. 2002). This leads to 
two possible scenarios. The feedback mechanism could cause 
an abrupt drop in the efficiency of galaxy formation in small 
halos, in which case the MW satellites should lie in the eleven 
most massive subhalos surrounding the MW. This could be true 
if the mapping between observed stellar velocity dispersions 
of the satellites and the size of the subhalos that host these 
satellites allows for the satellites to sit in large subhalos due to 
the significant tidal evolution of the subhalos (e.g. Stoehr et al. 
2002; Hayashi et al. 2003, though recent work by Kazantzidis 
et al. 2004b does not support this scenario). Contrarily, feed- 
back could set in gradually so that small subhalos become in- 
creasingly less likely to host luminous satellites. Including sev- 
eral other details, this type of scenario leads to a model of bi- 
ased galaxy formation in small halos that appears to agree well 
with many observed features of the MW dwarfs (Kravtsov et al. 
2004; Zentner et al. 2005c). 

If inefficient galaxy formation in small halos is the resolution 
to the missing satellites problem, galaxy-sized subhalos should 
be filled with massive, dark subhalos that would be devoid of 
stars and detectable only through their gravitational influence 
or, more speculatively, through the detection of gamma-rays 
(e.g., Silk & Stebbins 1993; Berezinsky et al. 1997; Bergstrom 
et al. 1999; Calcaneo-Roldan & Moore 2000; Baltz et al. 2000; 
Tasitsiomi & Olinto 2002; Stoehr et al. 2003; Koushiappas et al. 
2004) or antiprotons (e.g., Bergstrom et al. 1999; Bottino et al. 
1998; Donato et al. 2004) from dark matter annihilations in 
their dense cores (for further details see Bertone et al. (2005) 
and references therein). An unequivocal detection of subhalos 
through lensing could be used to constrain the properties of the 
dark matter and the primordial power spectrum on small scales 
(Dalai & Kochanek 2002b; Zentner et al. 2005b). A detection 
at sufficiently high levels could represent an enormous triumph 
for the CDM paradigm of structure formation with a standard, 
scale-invariant primordial power spectrum. Such a detection 
could help determine the nature of the feedback mechanism that 
leads to the dearth of small, luminous satellites in the MW and 
distinguish between the proposal of Stoehr et al. (2002) and 
Hayashi et al. (2003) and that of Kravtsov et al. (2004). 

It is important to note that, while dark matter substructure 
within the lens halo seems to be the leading interpretation of 
observed flux anomalies, it is certainly not the only possibility. 
Other possible sources of small-scale fluctuations in lens po- 
tentials include stars, correlated and uncorrelated external ha- 
los along the line of sight (Chen et al. 2003b; Moller & Blain 
2001; Metcalf 2004a,b; Wambsganss et al. 2004), and possible 
disk structures within the galaxy (Quadri et al. 2003; Moller 
et al. 2003; Bradac et al. 2004; Amara et al. 2004). Microlens- 
ing by stars in the lens galaxy, while expected and observed in 
some cases (e.g., Schild 1996; Refsdal 2000; Wozniak 2000; 
Wisotzki 2003; Schechter 2003; Richards 2004), should not af- 
fect the radio or narrow-line emission fluxes due to the source 
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size (Moustakas & Metcalf 2003; Kochanek & Dalai 2004; 
Dobler & Keeton 2005), and cannot be the cause of radio flux 
anomalies. 

In light of this discussion, predicting the properties of magni- 
fication perturbations due to CDM substructures in realistic lens 
systems is of paramount importance. Unfortunately, the de- 
tailed CDM predictions for lens systems are not yet completely 
understood. The first issue to address is whether or not the 
amount of substructure predicted by the standard CDM model 
is consistent with observed flux perturbations. This problem 
has been addressed analytically (Chen et al. 2003b), numeri- 
cally (Metcalf & Madau 2001), and observationally (Dalai & 
Kochanek 2002b; Metcalf & Zhao 2002; Chiba 2002; Bradac 
2002) with fairly consistent results: a projected substructure 
mass fraction, / su b = £ JM i/£ fof , of order a few percent near the 
images can reproduce the flux perturbations in observed gravi- 
tational lenses. Whether this constitutes agreement with CDM 
or not is unclear. While the total mass fraction in substruc- 
ture for CDM halos is ~ 10%, naive theoretical estimates of 
the projected subhalo mass fraction near typical Einstein radii 
of lenses (~ a few percent of the halo virial radius) are signif- 
icantly lower, yielding / su b < 0.5% (Zentner & Bullock 2003; 
Mao et al. 2004). However, it is not clear whether such a naive 
comparison is fair (Chen & Rozo 2005, in preparation) because 
massive substructures could have important effects even if they 
do not fall within the Einstein radius of the host galaxy. Fur- 
ther, spatially-biased substructure distributions will clearly af- 
fect the lensing potential, and to what extent these effects can 
be subsumed within the lens model remains unknown. In re- 
lated investigations, Bradac et al. (2004) have argued that the 
predicted amount of substructure in lenses is sufficient to ex- 
plain the level of cusp relation violations observed in the data 
using simulations of galaxy formation in CDM, though Amara 
et al. (2004) (see also Maccio et al. 2005) reached the opposite 
conclusion. In summary, strong lensing is an important probe 
of small-scale structure and the question of whether magnifi- 
cation perturbations are observed at the level that is expected 
from cold dark matter substructures has not been answered un- 
ambiguously. 

1.3. This Work 

In the present work, we quantify the magnifications perturba- 
tions predicted by the CDM model of structure formation and 
develop an understanding of the important aspects to consider 
highlighting several key parameters. In particular, we predict 
the statistical properties of magnification perturbations due to 
subhalos in realistic lens models. We begin in § 2 by gen- 
eralizing the work of Keeton (2003). In particular, using lin- 
ear perturbation theory we derive a general expression for the 
cross section of a single perturber to produce a magnification 
perturbation above some given size assuming only circularly- 
symmetric and monotonically-decreasing substructure surface 
density profiles. Appendix B and Appendix C illustrate our 
formalism in the context of point mass and Singular Isother- 
mal Sphere (SIS) perturbers. In this context, we highlight the 
importance of tidal truncation on the probability of finding non- 
linear perturbers by showing that the expected number of non- 
linear perturbers increases by an order of magnitude if the tidal 
truncation of substructure is ignored. 

In § 3 we build on these results to derive expressions for 
the average and variance of the total magnification perturba- 
tion due to an ensemble of perturbers (not necessarily identical). 



We then apply our results to the specific problem of magnifica- 
tion perturbations by CDM substructures within lens galaxies in 
§ 4. We use a simple, yet realistic, model for halo substructure 
which includes tidal truncation and a substructure mass spec- 
trum, and thereby predict the average and variance of the mag- 
nification perturbations generated by CDM substructures. This 
type of argument provides an important statistical test of the 
CDM paradigm, which is particularly relevant given that the 
level of substructure near the Einstein radius of CDM halos is 
expected to vary enormously from halo to halo (e.g., Zentner 
& Bullock 2003; Mao et al. 2004; Taylor & Babul 2005; Zent- 
ner et al. 2005a). Furthermore, our analytical treatment allows 
us to identify and isolate the various characteristic of substruc- 
tures to which the statistical properties of magnification pertur- 
bations are sensitive to, such as substructure profiles and their 
mass spectrum. 

We find that small, negative values for the average magni- 
fication perturbation are a generic prediction. In other words, 
positive-parity images generated by a lens with substructure are 
dimmer than the corresponding images for a lens of the same 
mass with no substructure. We show that this can be understood 
as a byproduct of requiring perturbations to be linear: introduc- 
tion of a nonlinear cutoff implies that a small fraction of the 
mass is ignored, which leads to a small negative magnification 
perturbation. The dimming of positive parity images is oppo- 
site to what is observed ;/ one assumes that the best-fit model 
for a lens is a good proxy for the lensing potential obtained by 
replacing all substructures by a smooth component, which we 
argue is likely not a good assumption. In § 5 we summarize 
our results and draw conclusions. Lastly, in Appendix A, we 
discuss in detail the validity of linear perturbation theory, and 
argue that linear perturbation theory should be valid whenever 
astrometric perturbations are negligible. Finally Appendix B 
and Appendix C illustrate our algorithm for computing mag- 
nification perturbation cross sections explicitly in the cases of 
point mass and singular isothermal sphere perturbers respec- 
tively. 



2. MAGNIFICATION PERTURBATION CROSS SECTIONS 

In this section, we illustrate a general algorithm for comput- 
ing the cross sections <t(5) for individual perturbers to generate 
magnification perturbations of size <5 = 5///|/z| or larger (fi is 
the unperturbed image magnification). Note that for a constant 
surface number density of perturbers s, the probability of find- 
ing a perturber that creates a perturbation of size S is simply 
dP = s\d<r/d8\d5. This forms the basis of our analysis of multi- 
ple perturbers and underlines the importance of understanding 
the magnification perturbation cross section ct(<5). 

Consider the image of a point source lensed by a smooth, pro- 
jected lensing potential ip, which we refer to as the macrolens 
or macromodel, and let k and 7 be the values of the conver- 
gence and shear fields at the image position. We investigate the 
magnification perturbation of the image when one introduces a 
small perturbation to the potential, Sip. In general, the pertur- 
bation Sip will not only change the flux of the image, but also 
its position. However, here we will work in the limit that as- 
trometric perturbations are negligible. We emphasize that neg- 
ligible astrometric perturbations means that the change in the 
convergence and shear values of both the macrolens and the 
perturber have to be negligible. As we show in Appendix A, 
we expect this assumption to hold for small flux perturbations, 
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that is, when 



/' 



< 1, 



(1) 



where the flux perturbation 5/j, is evaluated at the unperturbed 
image position. Consequently, we will only be considering 
small perturbations to the flux. 

Let Sip, 5k, and -<57(cos(20),sin(20)) be the profiles of the 
potential, convergence, and the two components of the shear of 
an individual perturber. We assume that all profiles vary mono- 
tonically. The Poisson equation for the potential ip is linear, 
so the total convergence and shear are given simply by k + 5k 
and (7— *7Cos(2</>),— *7sin(20)). Linearizing the magnification 
/jT l = (1 - ft) 2 - 7 2 , results in 6 

5(6,4,) = ^ = 2 1 /i| [(1 - ft)«5ft(0)- 7 <57(0)cos(2<«] , (2) 

where ft and 7 are the convergence and shear of the macro- 
model respectively. We have taken the origin to be at the center 
of the perturber, and the coordinates 9 and <f> represent two- 
dimensional polar coordinates. In what follows, it is more con- 
venient to choose the image (assumed fixed) as the origin of the 
coordinate system. In this case, the angle 9 is still the separation 
between the image and the perturber, but <fi becomes the angle 
of the perturber's position. One may check that the above ex- 
pression remains valid. The perturbation 5 as a function of per- 
turber position is illustrated in figure 1 . Using Equation (2), we 
define the cross section for magnification perturbations stronger 
than A > as 



a(A) = 



<5>A 



= (9d9d<j))H(5-A), 



(3) 



where H(x) is the Heaviside step function. Of course, if one 
is interested in perturbations 5 < 0, one ought to integrate over 
the region 5 < A instead. 7 To carry out the integral, one might 
fix the angle <fr, find the radial region over which perturbations 
stronger than A exist, and then integrate over all angles. Un- 
fortunately, this is not possible in general because the equation 
5(9, <fi) = A defines the ^-coordinate boundary implicitly. A bet- 
ter approach is to first find the range of radii 9 £ [9 m i n ,9 m . dx ], 
over which perturbations of size A or stronger are possible. 
Next, for all radii within this interval, find the angular region 
over which perturbers produce perturbations stronger than A. 
Then lastly, integrate this angular region over all radii to ob- 
tain the cross section. The advantage here is that the simple <j> 
dependence of the perturbation 5(9, <f>) allows one to solve ex- 
plicitly for the relevant angular intervals. 

Consider Equation (2). As 9 — ► 00, we have 5 — > 0, so it fol- 
lows that if perturbations ever get stronger than A, there must 
be a maximum distance f? max (A) at which 5 = A first occurs. 
The quantity 5(9, 0) is always bounded by 2|/z|((l — k)Sk± 
7*7), so r? max (A) is given by the solution to 

2\fl\ [(1 - ft)<5ft(#max) + 7<h(#max)] if A > 

2|//| [(1 - ft)<5ft(0 max )-7«5 7 (^ inax )] if A < 

(4) 

If perturbations stronger than A are never produced, we set 
#max(A) = 0. It is important to note that even though both 5k 

6 The negative sign in front of 7 in Equation (2) is due to our sign convention. 
One has 7 = — 7(cos(2<^),sin(2<j?))), so aligning the shear with the x axis can 
mean either <j> = for 7 = (~7,0) or <f> = 7r/2 for 7 = (7,0). We use the latter. 

7 We rely on context to specify whether the phrase "perturbations stronger than 
A" means \S\ > A regardless of the sign of 8, or 8 < A for negative A and 

8 > A for positive A. For instance, in this section we will be mostly interested 
in the latter case, but when computing the number of nonlinear perturbers, one 
is interested in perturbers with |<5| > A fa 1 regardless of the sign of 8. 



5 (A): 




FIG. 1 . — Shown above are the 5 contours around a positive parity image. 
The field <5(x) is defined as the size of the magnification perturbation experi- 
enced by an image at the origin when a perturber is placed at point x. The 
shaded contours are obtained assuming truncated SIS perturbers, with dark 
being negative and light being positive. The boundaries extending out at 45° 
angles in the shaded contours correspond to 8 = (no perturbation). Distances 
are measured in units of the Einstein radius and the tidal radius is assumed to 
be ten times the Einstein radius, clearly seen as a discontinuity in the shaded 
contours. The angle <j> is defined relative to the x axis, itself defined such that 
the macroshear is 7 = (7, 0). The solid contours are obtained using a Pseudo- 
Gaffe profile, and correspond to 5 = {-0.01,0.0,0.01,0.05,0.10,0.50}. The 
double lobe features of SIS perturbers found by Keeton (2003) are evident at 
distances smaller than the tidal radius. 



and *7 are monotonic by assumption, 5(9, (j>) need not be. If 
this is the case Equation (4), may have two roots: max (A) is de- 
fined as the larger of the two roots, while the second root is the 
minimum radius # m j n (A), for which perturbations stronger than 
A exist. If Eq. (4) has only one or no roots, we set 6> m j n (A) = 0. 
One might expect perturbations always to become stronger as 
the perturber approaches the image (i.e. that 5 varies mono- 
tonically), but this is not necessarily so. As a counterexample, 
point masses may create negative perturbations on a positive- 
parity image, while singular isothermal sphere (SIS) perturbers 
cannot. Consider an SIS truncated at some finite radius and 
placed at infinity so as to produce a negative perturbation. As 9 
decreases, the perturbation grows more negative, until the dis- 
tance from the image is equal to the truncation radius. At that 
point, the perturbation becomes positive. A truncated SIS has a 
minimum radius for which negative perturbations are possible. 

Let us find the angular region over which perturbations 
stronger than A exist for a fixed radius 9. The boundary points 
are defined by 5(9, 4>) = A, with a solution for <j> £ [0, ir/2] given 
by 8 



n (6»|A)=^cos- 



1 



(1-k)5k(9) 



(5) 



A 

2R 

exists in the region 



}■ 



Of course, a second solution, <j> = it - 
4> £ [tt/2,tt]. In principle, one is also interested in solutions 
with <j> £ [tt,2tt], but perturbations are symmetric under reflec- 
tion about the axis <j> = 0, so the total cross section is double that 
obtained by integrating Eq. (3) over the region <f> £ [0,7r]. Fi- 
nally, we can check that 5 always increases with <fi from 4>=<j) m 
while 5 decreases with <f> from cf> = ir— <j> m . The relevant angular 
interval for 5 > A with A > is thus <\> £ [</>„,, 7r-0 m ], whereas 

8 It is this step that is not always possible to perform when trying to deter- 
mine the allowed radial region for a fixed angle <j>. However, when possible, 
switching the order of integration may prove simpler (for example, see Keeton 
2003). 
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for A < 0, the appropriate region is <fi £ [0,</> m ] U [7r-0 m ,7r]. 
Physically, this statement says that perturbers are most effec- 
tive at brightening positive-parity images when placed perpen- 
dicular to the shear axis 71 = 7, while the opposite is true of 
negative-parity images. 

One possibility we have not considered is that there is no so- 
lution to 5(9, <fr) = A. For # m ; n < 8 < 8 max , one is guaranteed 
to have perturbations at least as strong as A, so a lack of solu- 
tions implies that at the given radius, perturbers at any angle <fi 
produce perturbations stronger than A. The limit 6(9, <fi) = A 
is never achieved, not because perturbations are not strong 
enough, but because they are always too strong. In such cases, 
the desired angular interval is <fi G [0,27r], so we define 4> m = 
for A > 0, and </> m = tt/2 for A < 0. 

We can now compute the magnification perturbation cross 
section by direct integration according to Eq. (3). This yields 

{ Tr9l ax -2j g 9 -(9d8)2M8\A) if A > 
\ 2f^(0d9)2(t) m (9\A) ifA<0. 
Equation (6) is the result we were seeking. We summarize our 
algorithm for computing cross sections as follows. 

1 . First, given the convergence and shear profiles of a per- 
turber, use Eq. (4) to find the maximum and minimum 
radii (6* m ; n and 6* max ) such that perturbations 5 stronger 
than A exist. 

2. Second, define the function <j> m (9 | A) via equation (5). 

3. Third, use Eq. (6) to compute the total magnification per- 
turbation cross section. 

In appendices B and C we illustrate this formalism using point 
mass and singular isothermal sphere profiles as simple exam- 
ples. This analysis reproduces the previous results of Mao & 
Schneider (1998) and Keeton (2003) in these two cases respec- 
tively. 

3. THE EFFECTS OF MULTIPLE PERTURBERS: THE AVERAGE 
PERTURBATION AND ITS VARIANCE 

In the previous section, we studied the effect of a single per- 
turber on an image and computed the cross sections for pertur- 
bations of a given amplitude. In this section, we extend these 
results and consider magnification perturbations due to an en- 
semble of perturbers. In particular, we compute the average 
perturbation (5j) and its variance in the limit that large mag- 
nification perturbations are unlikely and assuming that no indi- 
vidual perturber creates a perturbation larger than some linear 
cutoff Anl- 

Consider the magnification perturbation due to an ensemble 
of perturbers. Rather than adding these perturbers artificially 
and thereby increasing the mass of the lens, we assume the per- 
turbers constitute a redistribution of some fraction of the mass 
of the macrolens. This is equivalent to adding substructure 
while at the same time adding a negative mass component that 
traces the macrolens' mass distribution. Thus, we first treat the 
problem of simply adding substructure artificially, and at the 
end we include the effect of mass conservation by adding the 
magnification perturbation due to a hypothetical negative mass 
component. 

In our linear approximation, the total substructure magnifi- 
cation perturbation is the sum of the individual perturbations 



£ t (A0 = 2|m| 



(1 
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■ k) 8ki 



N 

-7^<S7,cos(2(/) I -) 



N 



Si. (7) 



In Eq. (7), Si, <5k,-, and Sji represent the magnification perturba- 
tion, convergence perturbation, and shear perturbation of the ;th 
perturber respectively. This relation should hold provided that 
the position perturbation 5a is negligible. Unlike the case of a 
single perturber, in the case of multiple perturbers, we do not 
have a "rule-of-thumb" condition for when this approximation 
holds. However, we naively expect that the approximation will 
be good when magnification perturbations are small. In partic- 
ular, oppositely-positioned perturbers give no net astrometric 
perturbation, but no analogous symmetry exists for magnifica- 
tion perturbations so long as 5k > (that is, the perturber repre- 
sents an increase in the surface density). Therefore, we expect 
position perturbations to grow no faster than magnification per- 
turbations. Henceforth, we assume Eq. (7) is valid (astrometric 
perturbations are negligible), and expect such an approximation 
to be justified if the net magnification perturbation is small, as 
inEq. (1). 

Given a perturber with a uniform probability of being in 
some finite region of space 1Z, about an image, the probability 
distribution p(5), for a magnification perturbation in the range 
5 to 5+d5 is given by 



p(5)d5 ■■ 



1 



d 2 9 6 D [5(9, 4,) -6]-- 



1 



da 



d5 



dS, (8) 



A(K)J n — — A{n) 

where a(5) is the cross section for the perturber to produce a 
perturbation stronger than 5, A(1Z) is the area of the region 1Z, 
and 5u[x] is the Dirac delta function of x. Though a uniform 
probability distribution for the perturber is unphysical, it is a 
reasonable approximation provided that the region over which 
perturbers can sizeably affect image magnifications is small 
compared to the length scale over which the projected number 
density of substructures varies. 

We compute the various moments of the distribution (8 n ), 
by defining the region 1Z as all points (9, (j>) such that <5 m i n < 
|<5(6>,^>)| < Anl, In other words, we assume that the perturber 
creates a perturbation stronger than some minimum perturba- 
tion Jmin, but weaker than the cut off Anl- Taking expecta- 
tion values, and using the fact that da(S)/dS < for 5 > and 
dcr(8) /d8 > for 5 < to integrate by parts, we obtain 



(<5») = 



1 



A(TZ) 



+ n 



■ A NL y„(A NL ) + Cn^min) 



dS 5"- l y n (5) 



(9) 



where we have defined the function y„(6) = a(5) + (-l)"a(-5). 

Consider an ensemble of identical perturbers characterized 
by a parameter vector p that specifies their projected mass den- 
sity profiles (e.g., mass for point masses, or velocity dispersion 
for SIS perturbers). The expectation value of the total perturba- 
tion is 

00 „ / N \ N 

(S T ) = ]T P(N) / UptfddSi ]T 6i(p) (10) 

AT=0 J V i=l / i=l 

= (N)(S\p), (11) 

where the quantity (S\p) is the expectation value of 5 due to an 
individual perturber with parameters p, and P(N) is the proba- 
bility of finding N perturbers in the region <5 m ; n < |<5| < Anl- A 
similar computation shows that the variance VAR(<5t) = (S T ) - 
(St) 2 is given by 

VAR(,5 T ) = (AO (5 2 |p) + [(N 2 ) - (N) 2 - (N)](5\ V ) 2 

= (N)(5 2 \p), (12) 
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where, similar to the previous equation, (<5 2 |p) is the expecta- 
tion value of 5 2 due to each individual perturber. To derive the 
second equation, we assumed that P(N) is Poisson, such that 
VAR(A0 = (N), with no correlation in the spatial distribution of 
perturbers. 9 

We can extend this discussion to an ensemble of perturbers 
with different properties. Let ds/dp be the number density of 
perturbers with parameters p within some infinitesimal volume 
of parameter space dp. The expected number of perturbers be- 
comes (N) = D^A(TZ)ds(p) I dp, where Z\ is the angular diam- 
eter distance to the lens. Placing this into our expressions for 
the average perturbation and summing over all perturbers (all 
parameters p), we obtain 



(5j 



ds 
dp —D 
dp 



■ A NL yi(A NL ) + <5 

min^l (<5min) 



As 



dS yi (6) 



(13) 



where Anl and 5 m i n are the maximum and minimum individual 
perturbations respectively. The total perturbation is obtained by 
taking the limit <5 m ; n — ► 0. Note that it is possible for the integral 
over 5 to diverge (see SIS perturbers in appendix C). Likewise, 
the variance of the total perturbation is 



VAR(<5 T ) = 



, ds , 
dP d^°- 



■ AnlJ2( Anl) + 5^2(5^) 



+ 2 



Anl 



dS 5y 2 (S) 



(14) 



Including mass conservation (the negative mass component 
mentioned above) is now trivial. If the mass density in sub- 
structures is / su bS c /2, the net perturbation becomes 

(<5T> = (*r)per,-H(l- ^/sub (15) 

where the perturber contribution (#T}pert is given by Equa- 
tion (13) above and we used the fact that the convergence per- 
turbation from the negative mass component is 5k = — / su b/2. 
This represents a constant contribution, so the variance remains 
unchanged. In Appendices B and C, we consider the average 
perturbation (Sj) and its variance VAR(cVt) for the case of point 
mass and SIS perturbers respectively. 

4. MAGNIFICATION PERTURBATIONS BY HALO 
SUBSTRUCTURES 

In this section we apply our formalism to the problem of 
magnification perturbations by substructures within the context 
of a simple, but realistic, model of CDM substructure that in- 
cludes a mass spectrum of perturbers and tidal truncation of the 
substructure profiles. We find that such perturbations are lo- 
cal in that the assumption of a constant surface density of per- 
turbers is valid. We find that while the most massive perturbers 
dominate the total magnification perturbation, the latter has sig- 
nificant contributions from a large range of substructure masses 
and these contributions depend upon the shape of the substruc- 
ture mass spectrum. In what follows, we ignore the negative 
mass component perturbation in § 4.1, § 4.2, and § 4.3, which 
deal with the details of computing the granular contribution to 
the magnification perturbation. We include the negative mass 
contribution in § 4.4, where we illustrate the expectation values 
for the specific models. 

9 Strictly speaking, one would expect the perturbers to be correlated. However, 
if the surface density of perturbers is sufficiently low, such correlations should 
be a small effect. 



4.1. Substructure Perturbations Are Local 

Following the analysis of Dalai & Kochanek (2002a), we 
adopt a simplified model for substructures in which perturbers 
are characterized by so-called pseudo-Jaffe mass density pro- 
files (see Munoz et al. 2001). The three-dimensional density 
distribution is 

r z (r z + a z ) 

where the parameter a is an effective tidal radius that should 
physically be present for all substructures and serves to localize 
the total mass of the perturber. 10 The two-dimensional conver- 
gence and shear profiles for the pseudo-Jaffe model are given 
by 



5k ■ 



and 



b b baff 
^ = 2e + 2i-W{a- 1 



(17) 



(18) 



where b is defined by the combination b/b= 1 + a/b — (1 + 
(a/b) 2 ) x l 2 , the quantity b is the Einstein radius of the profile, 
and £ = \J 6 2 + a 2 . When a — > oo, the profiles approach that 
of an SIS perturber with an Einstein radius b = b, while in the 
limit a — > with ab constant, the perturber corresponds to a 
point mass with Einstein radius b = \/ab. The total mass of the 
perturber is given by m = nab'S c D 2 ^. 

First, consider the average perturbation (5j), for a positive- 
parity image. Perturbers well within a tidal radius from the 
image act as SIS {5 k = 5^ ~ 1/6), so each individual perturba- 
tion 5 scales as 5 ~ 5k ~ 1/6, while the number of perturbers 
per logarithmic interval grows as ~ 2 . As a result, the con- 
tribution to 5j scales as ~ 9 ~ \/5, meaning that it increases 
with decreasing individual perturber strength, until a peak at 
<5 p « \n\(l - K)b/a, which is the typical perturbation from sub- 
structure one tidal radius away from the image. 

At large distances the behavior is slightly more complicated. 
One might naively expect point mass behavior to dominate, but 
the average perturbation due to point masses is zero. A nonzero 
contribution to 5i arises from the small, but nonzero, conver- 
gence 5k ~ I/O 1 (valid for 6 3> a). Therefore, the contribu- 
tion from perturbers a distance 6 away scales as ~ 6 2 5k ~ 1 /6, 
which, for convergence-dominated perturbations, is a ~ J 1 / 3 
scaling. In the limit 5 — ► 0, all perturbations eventually be- 
come shear dominated with the individual perturber strength 
5 ~ 1 /8 2 , so the net contribution to <5 T scales as ~ 1 /6 ~ 5 1 ' 2 . 

We illustrate this behavior explicitly in Figure 2, where we 
assume typical macromodel parameters k = 0.5 and 7 = 0.2. 
For comparison, we also present the case for truncated SIS per- 
turbers. At first glance, the curve for truncated SIS perturbers 
seems puzzling because the curve does not drop off to zero at 
the tidal radii (5 < 5 p ). The reason for this behavior is that we 
have been somewhat loose in our interpretation of y\(5)5. In 

10 We note that while a pseudo-Jaffe profile is not the most accurate treatment 
for the density profiles of dark matter substructures observed in simulations, it 
is qualitatively correct in that the matter density quickly drops beyond a trun- 
cation radius a. Due to its simplicity, and because it allows us to better relate 
to previous work which uses this or related profiles, we have opted to model 
substructures as pseudo-Jaffe profiles. Adiabatic contraction is one motivation 
for using steeper inner profiles, but this is likely to be irrelevant for the majority 
of small, subhalos that do not host luminous galaxies. The formalism that we 
present can be adapted to accommodate more general subhalo density profiles. 
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FIG. 2. — The contribution per logarithmic interval to the total perturbation 
due to perturbers of strength 8. The quantity Syi(S) (see text) must be mul- 
tiplied by a number density to get a perturbation, so we plot the dimension- 
less quantity b~ 2 8yi(8) where b is the perturber' s Einstein radius. All of the 
curves assume a macromodel with k = 0.5 and 7 = 0.2. From top to bottom the 
three solid and dashed lines correspond to perturbers with a/b = 500, 100,20, 
where a is the substructure tidal radius. The solid lines show the relation for 
pseudo-Jaffe profiles [cf. Eq. (16), Eq. (17), Eq. (18)]. The dashed lines are 
truncated SIS profiles. The falling dotted line shows the expected scaling for 
SIS perturbers, 5y\(S) oc 1/5, while the rising dotted line shows the expected 
scaling in the limit 8 — > 0. The broad peak for the pseudo-Jaffe profiles re- 
flects the smooth change from 8y\ oc 8~ l to oc <5'/ 3 . This transitions to oc 8 1 ' 2 
when perturbations become shear dominated (sharp break). The peak scales 
<5 p = \fJ.\(l —K)b/a are marked with short, vertical lines at the top of the Fig- 
ure. 



particular, the full contribution to 5j per logarithmic interval is 
Sp(S) = Sdyi/dS, which is zero in the region 6 < S p . To derive 
Eq. (13), we integrated Sdyi(S)/dS by parts. This halves the 
contribution from the region 5 > <5 P , while introducing a sym- 
metrical contribution in the region to the left of the peak. 

Analogous arguments hold for the variance of the magnifi- 
cation perturbations, VAR(£t). Perturbers within a tidal radius 
contribute ~ 9 2 d\n{9)5 2 ~ dln(S) to the variance, so all per- 
turbers of strength 6 < S p contribute at comparable levels to 
VAR((5t). Perturbers far away act as point masses and con- 
tribute an amount that scales as ~ 6 2 5 2 ~ S, where we used 
6 ~ I/O 2 for point masses. Finally, there is a small transition 
region between the peak and point-mass behavior where per- 
turbations are convergence dominated with 5k ~ 1 /6* 3 , corre- 
sponding to a ~ <5 4//3 scaling. We show the contribution to the 
variance from strength S perturbers in Figure (3). 

The above results have some interesting implications. In par- 
ticular, we have demonstrated that: (1) the average perturbation 
is dominated by perturbers that are about one tidal radius away 
from the image; and (2) the variance of the perturbations weighs 
all perturbers within a tidal radius equally, with perturbers fur- 
ther away contributing relatively little. In other words, magni- 
fication perturbations are effectively "local" to within the typi- 
cal tidal radii of the perturbers. How local is this? Consider a 
perturber a distance d, from the center of the macrolens poten- 
tial. Then d = V r 2 + z 2 , where r is its projected distance onto 
the lens plane and z is the separation along the line of sight. 
Assuming, for simplicity, truncated SIS profiles for both host 
and perturber, we can approximate the tidal radius a(r,z), by 
setting Pperturberia) = Phost(d). We are generally interested in 
the properties of substructures at a projected distance roughly 
equal to the Einstein radius of the macrolens, r = bn- We find 



FIG. 3. — The contributions per logarithmic interval to the variance of mag- 
nification perturbations. The lines are the same as in Figure 2, but the verti- 
cal axis now represents the contribution to the variance due to perturbers of 
strength 8 in the dimensionless form b~ 2 8 2 y2{8). Again, the macromodel has 
k = 0.5 and 7 = 0.2, and the substructure Einstein (£>) and tidal (a) radii satisfy 
a/b = 500, 100,20 from left to right. The dotted line is the expected scaling 
for point masses, <5 2 \'2(<5) oc 5, and the short, vertical lines at the top mark the 
peak scale 5 P = — n)b / a. 

(b/a) 2 = bbnj{b 2 H +z 2 )- Assuming, for simplicity, that substruc- 
tures trace the smooth component, we can obtain an estimate for 
the average value of b/a along the line of sight to the lens by 
integrating along the line-of-sight distance, z- This yields 

(b/a) » -L(2/n)(b/b H y' 2 , (19) 

with A„ = 1 . For comparison, Dalai & Kochanek (2002a) used 
X„ = 2/tt ~ 0.64. We introduced the parameter A fl , as a way 
to change the effective tidal radii in our model. Larger a cor- 
responds to systematically less centrally-concentrated density 
profiles. Kochanek & Dalai (2004) assumed X a = 2/tt in their 
analysis. For b/bH ~ 10~ 3 -10~ 2 (e.g., milliarcsecond per- 
turbers), we obtain a/bn ~ 5%— 15%. This distance is small 
enough that perturbations may be considered local in the sense 
that the two-dimensional surface density of perturbers does not 
vary much on this length scale (cf. Zentner & Bullock 2003; 
Mao et al. 2004; Diemand et al. 2004; Nagai & Kravtsov 2005), 
but large enough that magnification perturbations to nearby 
pairs of images may be correlated, especially if the substructure 
profiles are less centrally concentrated than we have assumed 
(A fl > 1). 

4.2. Dependence of Magnification Perturbations on the 
Substructure Mass Spectrum 

We now investigate the dependence of magnification pertur- 
bations on the mass spectrum of the substructure perturbers. 
Consider Equation (13) for the average perturbation. We ap- 
proximate the mass spectrum as a power law ds /dm oc m a , trun- 
cated at some maximum mass m max , where ds/dm is the num- 
ber density of substructures in the mass range from m to m+dm. 
The slope a has been determined by high-resolution, cosmolog- 
ical A^-body simulations, which generally yield a « — 1.8 ±0.1 
(e.g., Klypin et al. 1999; Moore et al. 1999; De Lucia et al. 
2004; Diemand et al. 2004; Gao et al. 2004). The surface mass 
fraction ff/ su b of perturbers of mass m is (l/2)£ c ff/ su b = dT, s = 
mds, so that the mass fraction varies as c// su b / dm oc m a+l . Here, 
we have chosen to follow Dalai & Kochanek (2002a) and define 
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FIG. 4. — The average magnification perturbation per logarithmic interval in 
substructure mass m. Masses are expressed in units of the smooth lens mass 
Me contained within the Einstein ring, and the most massive substructures 
considered are m max /M£ = 1% (see text). A substructure mass fraction of 
/sub = 5 X 10~ 3 with a power-law spectrum of index a was assumed. The 
macromodel parameters are k = 0.5 and 7 = 0.2. The solid lines are for the 
pseudo-Jaffe profiles, and the dashed are for truncated SIS profiles. From top 
to bottom at right, the value of a is a = —1.7,-1.8, and —1.9. The dotted line 
shows the expected m 2+a scaling for a = —1.7. 



the "mass fraction" / su b relative to half of the critical density, 
/ su b = 2E S /£ C . This choice is motivated by the fact that all im- 
ages in a lens will probe similar radii in the host halo, and thus 
similar substructure densities, whereas the macromodel conver- 
gence may vary more strongly from image to image. Note that 
the quantity mds/dm is precisely the number of perturbers of 
mass m per logarithmic mass interval, so that Eq. (13) can be 
recast as 



(#r 



1 

2 Jo 
1 

~(a + 2)/ su b 
L JO 



dmdf sub 2 f NL , x , x , 
m am J s 



dm 



m \ m mm 



a+2 



dd yi (S) 



(20) 



where we have ignored the contribution from the bound- 
ary terms by taking the limits <5 m j n — > and Anl — * 00, 
and we have normalized the substructure fraction as / su b = 
f dm dfsuh/dm, neglecting a possible low-mass cutoff by 
assuming that a > -2. The mass m max represents the mass scale 
of the most massive substructures contributing to the magnifi- 
cation perturbations. Below, we argue that this is given roughly 
by m mlLX /ME ~ 1%, where Me is the mass of the macrolens 
contained within an Einstein radius in projection. 

With these assumptions we can estimate how the integrand 
in Eq. (20) scales with mass. The perturbation by mass m per- 
turbers is dominated by substructures a tidal radius away from 
the image and scales as ~ a 2 {\/j,\b/a) ~ ab ~ m. Therefore, the 
term in square brackets is roughly constant, and the perturba- 
tion per logarithmic mass interval varies as ~ m 2 ds ~ mdf m \, ~ 
m a+2 or as ^ m o.i-o.3 f or a = _i 7 to - 1 .9. This dependence is 
rather weak. We show the m 2+a scaling explicitly in Figure 4, 
after enforcing a linear cutoff Anl = 1 (replacing the bound- 
ary term that we neglected earlier). We show curves for both 
pseudo-Jaffe and truncated SIS profiles. As is evident in the 



Fig. 4, the results are nearly identical for these two cases. Fi- 
nally, the term in the square brackets in Equation (20) is, to a 
good approximation, a constant function of perturber mass, so 
that we can write 



2 /sub 



dSyi(S) 



(21) 



The error introduced by making this approximation is w 5% 
and is nearly independent of the exact value of m max so long as 
the peak scale 6 P < Anl, where we reiterate that Anl is a linear 
cutoff scale. 

As before, we can construct a similar argument for the vari- 
ance of the magnification perturbations. In this case, all per- 
turbers within a tidal radius contribute equally to the vari- 
ance, so a linear cutoff Anl, needs to be introduced. This 
constant contribution scales as ~ a {bid) 1 ~ b 2 ~ m 4 / 3 , and 
is integrated between the peak scale 5 p and the linear cutoff 
Anl- The contribution per logarithmic mass interval is thus 
~ (mdf sab )m^ 3 ln(A/(Sp) - m Q+7 / 3 ln(A/5 p ). For a = -1.8, this 
is ~ m 8// ' 5 ln(A/(5p). As before, we can use our understanding 
of this behavior to provide a simple estimate of the variance, 



VAR(5 T )«i(a + 2)/ sub 



™' x dm 
111 



2£ C Z> 





2+a+i 



d6 Sy 2 (5) 

- m=rn, 

In (A/6p(m)) 



In(A/i5p(m max )) 



22) 



This approximation is valid to about ps 5% and, once again, it 
breaks down as <5 p approaches Anl = 1 ■ We illustrate these re- 
sults in Figure 5, and we note particularly the flatness of the 
variance per logarithmic mass interval, which is shallower than 
dln[VAR{dt)] / dlnm< 1/2. Note that the expression above dif- 
fers from that of identical mass m max perturbers, demonstrating 
that the statistical properties of magnification perturbations by 
substructures depend on the mass spectrum of the perturbers. 
This is not surprising as such a dependence has been noted in 
the case of microlensing (e.g., Schechter et al. 2004). Finally, 
from Figure 5, we see that SIS and pseudo-Jaffe profiles lead 
to different variances at a fixed mass fraction, implying that 
the variance of the magnification perturbation is sensitive to the 
substructure mass density profile. 

4.3. Truncating the Substructure Mass Spectrum 

It is interesting to consider what the appropriate values for 
the effective minimum and maximum masses, m max and m m j n , 
are. We begin by considering the relatively simpler problem 
of the low-mass cutoff, m m j n . In general, the magnification 
perturbation due to a perturber smaller than the physical ex- 
tent of the source region is small (e.g., Schneider et al. 1992; 
Dobler & Keeton 2005). The average perturbation is domi- 
nated by perturbers a distance ~ a from the source, where a 
is the tidal radius of the substructure, so the point source ap- 
proximation should be valid so long as the characteristic size 
of the source R is smaller than a. The minimum substruc- 
ture mass can therefore be obtained by setting a « R. Taking 
R ~ 10 pc, as appropriate for quasar radio-emission regions, 
we find m m m/M F . > A^ 2 10~ 9 , small enough to neglect for our 
present purposes 1 1 . 

1 1 One could require instead b PS R, which is appropriate in the limit that in- 
dividual perturbations dominate the net magnification perturbation. This gives 
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FIG. 5. — The variance of the magnification perturbation per logai'ithmic 
mass interval. A substructure mass fraction of / su b = 0.5% with a power-law 
spectrum of slope a was assumed. The linear cutoff Anl is set to unity and 
the macromodel parameters are re = 0.5 and 7 = 0.2. The solid lines are for 
the pseudo-Jaffe profile and the value of a is, from top to bottom at right, 
a = —1.7,-1.8, and -1.9. The dashed line is for a truncated SIS profile with 
a = -1.7. 



Estimating the appropriate maximum effective substructure 
mass m max , is a considerably more difficult problem. In the con- 
text of CDM, the most massive substructures within a halo have 
masses that are typically of order m max ~ 10~ 2 M v j r , where M v ; r 
is the virial mass of the halo (e.g., Klypin et al. 1999; Moore 
et al. 1999; Ghigna et al. 2000; De Lucia et al. 2004; Diemand 
et al. 2004; Gao et al. 2004). For a halo with M vir = 10 13 M Q , this 
corresponds to m ~ 1O U M , which is comparable to the mass 
of the host halo contained within a cylinder of radius equal to 
the Einstein radius of the lens. Clearly, if such a substructure 
were to fall near the Einstein radius of the host halo, all images 
would be strongly perturbed and the lens macromodel would 
be significantly altered (see, e.g., Cohn & Kochanek 2004, and 
references therein). In the most naive view, one could imagine 
that such effects are either entirely degenerate with the macro- 
model, or else explicitly including them when the substruc- 
tures can be optically identified. In this naive picture, mag- 
nification perturbations come about only through substructures 
that do not affect the macromodel. Taking the primary macro- 
model property to be the Einstein radius, one can demand that 
'«max <C Me cx b 2 H , so one may expect mmax/ Me = 1% to be a 
reasonable cutoff mass. 

A more pragmatic self-consistency argument can also be 
made. We have assumed in our formalism that we can ne- 
glect astrometric perturbations relative to the magnification per- 
turbations, so we should include only perturbers for which 
the characteristic perturbation scale 6 P is linear. We can then 
define m max by setting 5 p (m max ) = 1, which leads to m max ~ 
X 4 a {Tr/2) 4 \n\- 3 . For A a = 1, = 10, and <5 P = 1, we get 
'«max/A^£ ~ 0.6%, though it is clear that this value depends 
upon the various parameters. However, it is important to note 
that less centrally-concentrated profiles (A fl > 1) allow for sig- 
nificantly larger mass cutoffs, implying that linear theory be- 
comes a better approximation. In what follows, we truncate the 

"imm/^fe ^ A a 10~ 5 . In our model, the number of nonlinear perturbers of mass 
m scales as m a+1 fo 2 ~ m a+7 / 3 ~ m 1 / 2 , implying that the number of nonlinear 
perturbers increases with mass. At most, or order one nonlinear perturbers will 
be present, and these will all be massive. Hence, the appropriate limit for m ra i n 
is that of lineal' perturbers obtained by setting R ss a as above. 



FIG. 6. — The average and rms values for the total magnification perturba- 
tion due to linear perturbers of an image with macromodel magnification fi and 
convergence re. Solid contours give the rms values while dotted contours give 
the average perturbation. Lens and source redshifts of zi = 0.3 and z s = 1 re- 
spectively, were assumed, and a high-mass cutoff m max /M£ = 1%, where Me 
is the mass of the macrolens projected to within an Einstein radius. The tidal 
radii were set by A„ = 4. The substructure mass fraction is / sll b = 0.5%, and 
mass conservation is enforced through a negative mass component of surface 
density — /% = 2E C / S ub. The unshaded region is unphysical as it corresponds to 
7 < 0. The rms scatter is comparable to that shown in Figure 3 of Kochanek 
& Dalai (2004). 



mass spectrum at m max /M£ = 1 %, regardless of the macromodel 
parameters. 

It should be clear from these arguments that the truncation 
scale m max carries with it a some uncertainty. In general, any 
given macromodel should be able to partly, but not fully, com- 
pensate for perturbations due to very massive substructures, in 
which case there will be an additional contribution to the ob- 
served magnification perturbations that is not being included 
by introducing a cutoff m max /M£ = 1%. Estimating the degree 
to which the effect of massive substructures can be absorbed 
within the macromodel is beyond the scope of this work, but 
we explore this in a forthcoming companion paper (Chen & 
Rozo 2005, In preparation). 

4.4. General Predictions 

We can use this formalism to make some general predictions 
regarding magnification perturbations due to halo substructure. 
Figure 6 illustrates predictions for the average and rms value 
of the total magnification perturbation in the case of positive- 
parity images. As an illustrative example, we have assumed a 
source redshift of z s = 1 and a lens redshift of z\ = 0.3. We have 
also assumed a high-mass cutoff ra max /M£ = 1%, a substruc- 
ture mass fraction / su b = 2S S /S C = 0.5%, a tidal radius given by 
A fl = 4, and a typical host Einstein radius of bu = 1". Although 
the average magnification perturbations are very small (< 1%), 
the rms fluctuations are quite large (<5™ s > 10%), and hence 
large magnification perturbations should be common. Our rms 
estimates seem comparable to the scatter in Kochanek & Dalai 
(2004, 5 r T ms sa 10- 15%, compare to their Figure 3), though the 
average perturbation seems markedly different, even at a qual- 
itative level. Particularly, we find small (|(#t)| "C S™ s ), nega- 
tive average perturbations, whereas Kochanek & Dalai (2004) 
report large (|(<5t)| ~ 10% ~ <5™ S X positive average perturba- 
tions. 

How serious is the discrepancy between the model and ob- 
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served lenses and what is it telling us? To address this ques- 
tion, we consider in some more detail our expression for the 
average perturbation. We now show that small negative per- 
turbations are a generic prediction of linear perturbation the- 
ory when the total mass of the lensing potential is held fixed. 
Consider again our expression for the total magnification per- 
turbation, Equation 7. For simplicity, assume that all perturbers 
are identical. Consider the shear contribution to the average 
perturbation. Two perturbers equidistant from an image and lo- 
cated at position angles <f> and 0+71-/2 create equal and opposite 
shears so the net shear contribution to the average perturbations 
is zero, to a first approximation. The only nonzero contribution 
is caused by the convergence perturbations, so we can write 

<<5 T )~2H(1- K )(fc), (23) 

where the expectation value of the convergence perturbation is 
yet to be determined. An argument similar to the one in § 3 
results in 

(&)=VP(JV)JV/ (24) 
N J-r. a \i<-) 

where the region A(1Z) is defined as before (i.e. E 1Z iff 
<5min < 6(0) < Anl) and N is the number of perturbers. We 
assume that P(N) is a Poisson distribution with an expectation 
value (N) = sA(lZ), where s is the mean number density of per- 
turbers. Observe that if the integral over k in the above expres- 
sion were extended over all space, we would recover the mass 
of the perturber. This leads us to define the nonlinear and linear 
mass fractions as 

/nl = / d 2 9 6k(6), (25) 

m J\6\<AnL 

and /l = 1 - /nl respectively, where m is the perturber mass. 
Thus, /nl is the fraction of the perturber mass contained within 
the nonlinear region around an image for a perturber centered 
on the image. Using these definitions, and taking the limit 
6mm 0, Eq. (24) becomes 

(6k) = (N) ^'J^ = «,(1 -/nl), (26) 

where k s = sm/(E c D L ) is just the convergence contribution by 
substructures of mass m. Replacing this contribution in our ex- 
pression for the average perturbation, and including the nega- 
tive mass component — K s , we obtain 

(5t)un=-2|m|(1-k)kjAl- (27) 

Thus, the average perturbation due to linear perturbers is always 
expected to be negative and approaches zero in the limit that 
local, linear perturbation theory holds. Indeed, our choice A a = 
4 for Figure 6 was made to make /vl small so as to minimize 
nonlinear effects. Lowering A a increases the nonlinear fraction 
/ML and hence (6j) becomes more negative (Jul oc 1/ \f\, at 
fixed mass). 

It is important to emphasize that a negative (<5t) implies that 
positive-parity images will be dimmed and negative-parity im- 
ages will be brightened, which is opposite from the trend for 
observed lenses (Kochanek & Dalai 2004), provided one uses 
the predicted macromodel magnification as a proxy for the ob- 
served magnification if the substructures were replaced by a 
smooth component. Because we assumed only that perturba- 
tions are linear and local, the implications are that either real 
lenses are significantly affected by nonlinear perturbers, per- 
turbations are at least partly due to nonlocal perturbers, or the 
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FIG. 7. — The expected number of substructures for various conditions. 
The unshaded region above the thick, solid line is unphysical for posi- 
tive parity images. The shaded regions below are obtained by computing 
the expected number of perturbers that create perturbations stronger than 
<5 p (see Figure 2), and correspond, from top to bottom, to (N\S > 5 p ) g 
[0, 20] , [20, 26] , [26, 29] , [29, 30] , and 30 or more perturbers. All substructures 
parameters are as per section 4.4. We loosely interpret (N\8 > S p ) as the 
number of substructures found within a tidal radius of the image (see text). 
The solid contours are for (jV|<5 > 1), and give the expected number of non- 
linear perturbers, which is seen to by quite small, < 10~ 2 . The dashed con- 
tours are for a more conservative linear cutoff Anl = 20%, for which we find 
{N\8 > 20%) < 0.1. Truncated SIS profiles give results similar to that of the 
pseudo-Jaffe profiles. 



smooth macromodel magnification is a poor proxy for what the 
magnification would be in the absence of substructures. Note 
that these various possibilities are not mutually exclusive. Of 
course, an alternative, and perhaps more contentious, possi- 
bility is simply that substructures within lens galaxies are not 
responsible for the observed magnification perturbations. As- 
suming this last possibility is incorrect, what is the most likely 
explanation for our results? 

Nonlocal perturbations do not appear to be the most reason- 
able resolution. So long as linear perturbation theory holds, our 
naive expectation is that the average convergence perturbations, 
and hence the average magnification perturbation itself, should 
be close to zero. Further, the two-dimensional surface num- 
ber density of substructures predicted for CDM halos is a very 
slowly-varying function of radius out to several times the Ein- 
stein radius of the lens, so a constant perturber density seems 
like a reasonable approximation. Thus, we do not view non- 
local (non-constant substructure surface density) perturbations 
as a viable explanation for the observed discrepancy. However, 
it is worth noting that if nonlocal perturbations play a signifi- 
cant role, naive comparisons of the substructure mass fractions 
derived from observational studies to those in simulations and 
semi-analytic models at radii of order a few kpc, as have been 
performed so far (e.g., Zentner & Bullock 2003; Mao et al. 
2004), do not constitute fair comparisons. This alone warrants 
further study of the effects of nonlocal perturbers. 

What about nonlinear perturbers? In Figure 7, we show 
the expected number of non-linear perturbers (N\6 > Anl), 
for Anl = 1- It is clear that this number is quite small, typ- 
ically (N\6 > 1} < 10~ 2 . Even for a linear cutoff as small 
as Anl = 20%, the number of nonlinear perturbers is (N\6 > 
20%) < 10 _1 . The prevalence of nonlinear perturbers is un- 
likely, unless observational biases preferentially select systems 
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where nonlinear effects are large, in which case more sophisti- 
cated methods must be used to compare theoretical predictions 
to observational data, perhaps using Monte Carlo methods like 
those of Zentner & Bullock (2003) directly to model lens sys- 
tems. While our results depend on our particular choice of pro- 
files, it seems unlikely that they underestimate the number by 
significantly more than an order of magnitude, which is neces- 
sary to make nonlinear perturbers likely. Nevertheless, includ- 
ing many more massive substructures might add significantly to 
the number of nonlinear perturbers, as would adopting signif- 
icantly higher substructure mass fractions, though higher mass 
fractions seem unlikely in the context of CDM. 

Even in such a case, it is difficult to imagine that the num- 
ber of nonlinear perturbers could be greater than a few. We 
would then expect the contribution to the magnification pertur- 
bation by nonlinear perturbers to have a large rms scatter so that 
<5™ s ~ (Sj). Because the scatter from linear perturbers appears 
to be broadly consistent with observed values, we expect it to 
be difficult to explain a nonzero net magnification perturbation 
with nonlinear perturbers without at the same time increasing 
the scatter in the perturbations beyond that of observations. 

If neither nonlocality of the perturbations nor nonlinear per- 
turbers can explain how positive- (negative-) parity images get 
brighter (dimmer) on average, then the only remaining possi- 
bility that is consistent with perturbations caused by CDM sub- 
structure is that for any given lens, the best-fit smooth model 
prediction for the magnification is a poor proxy for what the 
magnification would be if the mass in substructures were redis- 
tributed as a smooth component. We address these problems in 
detail in a forthcoming companion paper (Chen & Rozo 2005, 
In preparation). 

Before proceeding, we emphasize the importance of tidal 
truncation of substructures in the expected number of nonlinear 
perturbers. In particular, if we assume pure SIS perturber pro- 
files, the number of perturbers in the region \ S\ > 20% increases 
by a factor of ~ 10 and becomes of order ~ 1, as was demon- 
strated by Chen et al. (2003b, compare to our Figure B8). This 
reflects the fact that, in our model, trsis(20%) > a 2 for substruc- 
tures of mass m max , implying that substructures become tidally 
truncated within the region Ssjs > 20%. Thus, use of simple 
SIS profiles that are not tidally-truncated may overestimate the 
importance of nonlinear substructures. Moreover, this implies 
that in order to draw robust conclusions from lens systems, the 
effective truncation of substructures must be well understood in 
systems with a variety of different formation histories, a prob- 
lem which is theoretically challenging. 

4.5. Individual Lenses 

Though the model that we presented is simplified and our re- 
sults should generally be interpreted on a statistical basis, it is 
interesting to consider how this analysis can aid in the under- 
standing of individual systems. For instance, if the probability 
of finding a substructure within a tidal radius of an image is 
small, then the average perturbation ought to be a rather poor 
estimator of the most likely perturbation. In particular, the ge- 
ometric weighting 9 2 of substructures a distance 8 away from 
the image is clearly not appropriate in this case. Conversely, 
a large number of perturbers within a tidal radius would al- 
low us to invoke the central limit theorem, in which case the 
probability distribution p(Sj) would be well approximated by 
a gaussian distribution. This raises a natural question. How 
many perturbers are found within a tidal radius or, more pre- 



cisely, how many perturbers are found in the region S > S v l In 
what follows, we refer to these two conditions interchangeably. 
The rigorous condition will always be 5 > <5 P , and we loosely 
interpret this as perturbers found within a tidal radius from the 
image. 

Figure 7 shows that for typical lens systems, the expected 
number of substructures with 5 > S p is of order ~ 20 when all 
substructure parameters are as described in § 4.4. The exact 
number is somewhat sensitive to the cutoff and density profile. 
For instance, at A a = 1 the number of perturbers within a tidal 
radius decreases to a few to several perturbers, and increases 
again to ~ 10 perturbers if one lowers m max by an order of mag- 
nitude. It seems likely then that in most lens systems the total 
magnification perturbation is due to from a few to about twenty 
substructures contributing at approximately comparable levels. 
Thus we expect that actual perturbations may differ substan- 
tially from the average perturbation, implying a large variance 
(see Figure 6) and a non-gaussian probability distribution for 
the magnification perturbations. 

5. DISCUSSION AND CONCLUSIONS 

The most surpising result from our analysis is that at a fixed 
lens mass, introducing a clumpy component tends to demag- 
nify positive-parity images while magnifying negative-parity 
images on average, with the net effect tending towards zero as 
nonlinear perturbers become less important. This does not im- 
ply that the traditional arguments that adding substructure will, 
on average, increase the magnification of positive-parity images 
are wrong. Rather, it emphasizes that the increase in magnifi- 
cation of the observed image is, in part, due to the simple fact 
that mass is being added to the lens. 

If one adopts the predicted magnification from the best-fit 
smooth macromodel to a lens as a proxy for what the magni- 
fication would have been if the substructure were not present 
and the lens were smooth, the observed average perturbation is 
positive, opposite from what we predict. For low substructure 
mass fractions (comparable to the level predicted by the CDM 
paradigm of structure formation), nonlinear perturbers are un- 
likely and hence nonlinearities, which were disregarded in our 
study, are not a likely explanation for the observed discrepancy. 
Further, if the scatter due to linear perturbers is found to broadly 
match that in observations, it will be difficult for nonlinear per- 
turbers to explain the observed average perturbation without, at 
the same time, significantly increasing the scatter. This sug- 
gests that the use of the best-fit smooth model magnification as 
a proxy for the magnification of a smooth lens may not be jus- 
tified. We investigate this specific question in more detail in a 
follow-up study (Chen & Rozo 2005, in preparation). 

Another surprising aspect of our analysis is the possibility of 
having sizable magnification perturbations arising from small, 
but coherent, contributions from several substructures within 
the lens halo. Such perturbations are difficult to account for 
in individual lenses because knowledge of the expected non- 
gaussian distribution p(Sj) of the perturbations is necessary to 
perform maximum likelihood analyses of particular systems 
(e.g., Mao & Schneider 1998). Likewise, the conclusions of 
analyses that assume a single substructure is responsible for the 
full magnification perturbation (e.g., Keeton 2001; Dobler & 
Keeton 2005) can change when multiple perturbers are present. 
As a simple example, the relation between magnification per- 
turbation cross section and the probability of an image being 
perturbed is weakened if large perturbations can arise from the 
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coherent contribution of multiple, weak perturbations. 

We emphasize that the importance of coherent perturbations 
depends on the tidal radii of substructures. This result un- 
derlines the importance of numerical studies such as those by 
Bradac et al. (2004) and Amara et al. (2004), where tidal lim- 
itation of substructures is inherent in the calculation. Yet the 
theoretical challenges remain significant. The fact that we ex- 
pect low-mass substructures (i.e. substructures with masses be- 
low the resolution limit of present day numerical simulations) 
to contribute to magnification perturbations at a non-negligible 
level and the fact that the scatter in the properties of substruc- 
ture from halo to halo is significant both argue for complement- 
ing direct numerical simulation with analytic (e.g., Chen et al. 
2003a) and semi-analytic computations (e.g., Zentner & Bul- 
lock 2003; Zentner et al. 2005a; Taylor & Babul 2005). Of 
course, the drawback of these studies is that substructure dy- 
namics and tidal truncation are handled only in an approximate 
way. 

One issue that was brought up in our analysis was the defini- 
tion of the substructure mass fraction. In particular, we focused 
on magnification perturbations induced by granularity, so we 
argued that for a host halo with with a mass Me within its Ein- 
stein radius, one is interested in the mass fraction of substruc- 
tures with mass m% m max , where m max w 10~ 2 M£ ~ 10~ 4 M v j r . 
Unfortunately, the huge dynamic range that is necessary to 
probe the properties of halo substructure with A^-body simula- 
tions limits their ability to probe substructures to masses larger 
than m > 10" 5 M vir (e.g., Mao et al. 2004) and to limit such stud- 
ies to a relatively small number of systems. Therefore the CDM 
predictions for the appropriate substructure values require some 
extrapolation and are thus may be robust than one might hope. 
There are other theoretical uncertainties as well. In particular, 
Oguri (2004) pointed out that in systems where the lens galaxy 
is a member of a group or a cluster, the relevant substructure 
mass fraction should include contributions from substructures 
within the group halo itself. Likewise, anisotropy of the host 
halo and its substructure population (e.g., Zentner et al. 2005c; 
Kang et al. 2005; Libeskind et al. 2005) leads to significant sys- 
tematic variations in substructure mass fractions along various 
lines of sight and may alter the expectation value of the sub- 
structure mass fraction due to projection effects (Zentner, Rozo, 
& Kravtsov 2005, In preparation). Finally, the abundance of 
subhalos in CDM cosmologies depends on the amplitude of the 
power spectrum on small scales (Zentner & Bullock 2003), and 
may depend as well on baryonic processes within the host halo 
that are yet to be fully understood. More theoretical work is 
needed to fully characterize the CDM predictions for the sub- 
structure mass fractions relevant for lensing. 

While the detection of dark substructures through image 
splitting or magnification perturbations of extended sources (In- 
oue & Chiba 2003), would provide fairly unambiguous evi- 
dence of CDM substructures, until such detections become fea- 
sible our only way to probe these dark substructures is through 
their effects on multiply-imaged sources. This makes unam- 
biguous detection of dark substructures in individual lenses 
more difficult because more "mundane" causes for the small- 
scale perturbations to the lensing potentials exist. Spectro- 
scopic analyses of lenses may go some way in removing these 
ambiguities (Moustakas & Metcalf 2003), especially if comple- 
mented by studies of the associated astrometric perturbations 
(Chen et al. 2005, in preparation) and by looking for distortions 
in lensed jets (Metcalf & Madau 2001). In light of these ambi- 



guities, we believe that if CDM were to provide concrete pre- 
dictions for the statistical properties of the magnification pertur- 
bations induced by substructures, confirmation of these proper- 
ties in observed lens samples would provide strong evidence in 
support of the CDM paradigm. In this paper, we have identified 
one way (granularity) in which CDM substructures can pro- 
duce flux perturbations, and shown analytically that a proper 
understanding of the substructure profiles, including their tidal 
limitation, and the mass spectrum of subhalos are all neces- 
sary components of a faithful treatment of the magnification 
perturbations and to obtain accurate estimates of substructure 
abundance. Nevertheless, we also expect other effects such 
as perturbations by massive substructures (Cohn & Kochanek 
2004) to play important roles, though the corresponding statis- 
tical properties of these perturbations remain to be understood. 
While a complete of understanding of the CDM predictions for 
the statistical properties of magnification perturbations is a dif- 
ficult task, it is one which is likely to be of vital importance for 
testing the CDM paradigm at small scales and in the highly- 
nonlinear regime. 
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APPENDIX 

A. WHEN CAN ASTROMETRIC PERTURBATIONS BE 
NEGLECTED? 

In general, if the potential of a macrolens is perturbed, the 
image position of a source will change in accordance with the 
perturbed lens equation. We wish to determine under what con- 
ditions is it possible to neglect such perturbations in comparison 
to the magnification perturbations. 

Consider a lens system where the unperturbed image position 
of a point source is 0. Neglecting astrometric perturbations, the 
only change to the convergence and shear sampled by the image 
are the values Sk and 5j of the perturber at the image position. 
Relative to this first-order perturbation, there are higher order 
corrections due to the image being displaced because both the 
macromodel and perturber convergence and shear at the new 
image position will be different. For these higher order terms 
to be negligible, it is required that 

and Sj^^SO, (Al) 

where SO is the position perturbation, and similar inequalities 
must hold for the image shear. To determine when these condi- 
tions are met, one needs to compute the perturbation 50. 

Let a spherically-symmetric perturber be introduced at a po- 
sition 6> p . The perturbed lens equation is 

9 + 30 = f3 + a(0 + S0) + 5a(0 + 50). (A2) 
Sex is the deflection due to the perturber, which is a function 
of the relative position of the perturber + 50-0 p . Linearizing 
a and using the unperturbed lens equation = (3 + a(0), we 
obtain 

M- 1 S0 = Sa(0 + S0), (A3) 
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where M~ l is the inverse magnification tensor due to the macro- 
model, M~ l jj = Sij - da.il 36 j. To get this equation we only 
needed to assume that the macrolens properties vary slowly 
over a distance \60\. To fully linearize the system and set 
5a.{8+56) w Sa(6), the system must satisfy the more stringent 
constraint 

85a. 



\5a\ > 



86 



56 



(A4) 



Assuming we can fully linearize the system and inverting we 
obtain 

50 = MSa. (A5) 

However, there is a self-consistency constraint provided by 
Equation A4 that must be satisfied. Inserting the fully- 
linearized solution into Equation A4, we find that one must 
have 

-MSa. 



\5a\ > 



88 



(A6) 



Consider the matrix 
85a . T _ / Sk+S^/i 



-M = 



hi 



5j2 

Sk—Sji 



1/At 




(A7) 





1/A fl 

7. Typically, one is inter- 



86 

where A« = 1 -K + 7 and \j = 1 — k- 
ested in images that form near the tangential critical curve of 
the macrolens, which satisfy I/A7- ^> 1/Xr. Setting 1/Xr = 
in the above expression leads to 



86a 
~~86 



Mwl/A 7 



^72 



(A8) 



The eigenvalues of this matrix are zero and (5k+5ji)/Xt- Let- 
ting e be the unit eigenvector corresponding to the nonzero 
eigenvalue, it follows that 



85a 



06 



MSa 



—(6K+5^i)(e-Sot) < -^-(Sk+Sj)\Scx\. (A9) 
Xt A^ 



Inserting this result back into Equation (A6), we find that full 
linearization of the problem is self consistent when 



-J-((Sk + (57)< 1 
At 



(A10) 



Now that we have found a consistent way to estimate the as- 
trometric perturbation to the image position, we can estimate 
the change in macromodel and perturber convergences between 
the original, unperturbed image position and the new image po- 
sition. 

Consider first the macromodel. If E is the Einstein radius of 
the macromodel, one typically expects 8k/ 86 w k/0 e for the 
macromodel, and 5a w SkO for the deflection angle due to the 
perturber. Inserting these into Eq. (Al) and using 56 f=a MSa « 
XtSkO, we find that position perturbations are negligible as far 
as the macromodel is concerned provided 

0< x T o E <o.w E . (Aii) 

That is, position perturbations due to nearby perturbers are ex- 
pected to be negligible (relative to the convergence perturbation 
5k). The corresponding condition for the perturber is 

5k 

t-«1, (A12) 

AT 

a constraint that is already contained within the self-consistency 
requirement given by Eq. (A10). A similar computation for the 
shear yields the exact same conclusions. It follows that if the 
linearization condition, Eq. (A10), holds, we may neglect as- 
trometric perturbations when computing magnification pertur- 
bations. 



We emphasize that if an image lies very close to the tangen- 
tial critical curve, the condition expressed in Eq. (A 10) may 
represent a very stringent requirement. For order of magni- 
tude purposes, one can assume 5k ~ 5~f and Ar ~ 1, so that 
our requirement for neglecting astrometric perturbations can be 
roughly expressed as 

5/c+(57« (A13) 
In particular, this means that the maximum value that a pertur- 
bation 5k, or 6 j, can take while still producing negligible astro- 
metric perturbations is of order \fj,\ . It is possible to go even 
further and relate this to flux perturbations. We saw in Eq. (2) 
that for small perturbations 5k and £7, the flux perturbation is 
given by 



=2|m|{(1-«)*k+7*Ti}- 

I A* I 



(A14) 

1 /2, so that the relative 



One expects in general that k ~ 7 
flux perturbation 5[i/\fj,\ is of order 

^L^MtfK + Si). (A15) 

The linearization condition we derived, Equation (A13), states 
that the right-hand side of Eq. (A15) ought to be much smaller 
than one. Hence, another way of phrasing the linearization con- 
dition is simply 



5 (J, 

rr <<L 



(A16) 



That is, astrometric perturbations to the image position may be 
neglected when considering small flux perturbations. 

B. POINT MASS PERTURBERS 

Consider the perturbations due to a point mass of mass m. 
The perturbing potential, convergence, and shear are 

Sip(0) = b 2 ln(6), 5k(0) = O, and Sj(6)=^, (Bl) 

where b 2 = m/ / nT, c D\ j is the Einstein radius of the point mass. 
Inserting these expressions in Equations (4), we find only one 
root so that 



e 2 ( A) = ^lb 2 



and 



C(A) = (B2) 



for both A > and A < 0. Likewise, inserting the shear and 
convergence profiles in Equation 5, we find 

2*.(AW«-< (-,!£). (B3) 

Using the above expressions and carrying out the integrals in 
Equation (6), we find 



a(A ) = ^Mb 2 . 



(B4) 



This holds for both A > and A < 0; the cross sections for 
brightening and dimming an image are exactly equal to each 
other. This is to be expected: point mass perturbers couple only 
through the shear, so perturbers placed at angles <f> and <fi + ir/2 
create equal and opposite perturbations. Since no such symme- 
try exists for perturbers with nonzero convergence, the equality 
fi(A) = ct(-A) does not hold in general. This symmetry also 
implies that y\(5) = and hence the average magnification per- 
turbation due to point masses is zero 12 . However, the variance 

12 This is not correct when nonlinear perturbations are likely, in which case 
negative-parity images have a large probability of being strongly demagnified. 
Positive-parity images are also demagnified on average, but the effect is not as 
strong as for negative-parity images (Schechter & Wambsganss 2002). 
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FIG. B8. — The solid contours give the expected number of stars in a lens- 
ing galaxy which produce strong (nonlinear) magnification perturbations on an 
image. A linear cutoff Anl = 0.2 and a stellar mass fraction /* = 0.1 at the 
galaxy's Einstein radius are assumed. The lightly-shaded area represents the 
allowed region of parameter space for positive-parity images, and contours are 
for 1,3, and 5 nonlinear stars. We conclude that stellar magnification perturba- 
tions to multiply-imaged quasars will generally be large and nonlinear (hence 
microlensing). The dashed contours illustrate the number of mildly nonlinear 
(5 > 20%) SIS perturbers predicted by an SIS model assuming that the number 
density of perturbers is equal to that assumed in § 4.4 for pseudo-Jaffe profiles. 
This is to be compared to the dashed lines in Fig. 7. The number of nonlinear 
SIS perturbers is of order unity, whereas the number of nonlinear perturbers 
for the truncated profiles is < 0. 1 . This reflects the fact that tidal truncation 
occurs within the region <5 > 20% for an SIS perturber. We conclude that SIS 
profiles lead to strong overestimates of the importance of nonlinear perturbers. 



is nonzero. Inserting Eq. (B4) into Eq. (14), and using the defi- 

m, we obtain 
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where we have defined the perturber mass fraction / SU b, such 
that / su bKS c = J dmjgm and (N\ A) is the expected number of 
perturbers that produce a perturbation stronger than A. Inter- 
estingly, the variance of the perturbation does not depend in any 
way on the perturber mass spectrum. 

Lensing by stars: As a particular application, consider mag- 
nification perturbations due to stars in the lens galaxy. We take 
all stars to be identical in mass and define /* as the surface mass 
fraction of stars at the image position, /* = £*/£;r of . The ex- 
pected number of stars in the region |<5| > A (note the absolute 
value) is thus 



(N\\d\>A) = ^D 2 h {a(A) + a(-A)}: 

m* 



/* (B7) 



For /* > 0.1, k w 7 w 1/2, and fi w 10, we get (N\\5\ > 
1) > 0.3. We expect then, that nonlinear perturbers will play an 
important role in determining the observed magnification of an 
image, a result that has been amply demonstrated in previous 
studies (see for example Schechter & Wambsganss 2002) 

C. SINGULAR ISOTHERMAL SPHERE PERTURBERS 

Consider perturbations by a singular isothermal sphere. The 
perturbing potential, convergence, and shear profiles are 



5k ■■ 
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and Sj = 
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where b is the Einstein radius of the perturber, given by 

'cr v \2D LS 
c) D S ' 

and o> is the velocity dispersion of the perturber. From Equa- 
tion (4), the nonzero solutions for 6> max are 

if A > and A« > 
if A < and A r < 0. 

(C3) 

max = for A > 0, A + < and for A < 0, A_ > 0, and mia = 
always. Note 6* max (A) = implies that perturbations of size A 
are never realized, so it is impossible for SIS perturbers to either 
dim a positive-parity image or brighten a doubly negative-parity 
image. Finally, from Equation (5), we find 

2 ( /> ffl ((?|A) = cos- 1 f — J^?A 
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Inserting our above expressions into Equation (6), and defin 
ing the amplitude ctq via 
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we obtain 



• A > 0, A_ > 0, A + > : Brightening of a positive-parity 
image. 

a = a [2(l- K ) 2 + 7 2 ] 

• A > 0, A- < 0, A+ > : Dimming of a negative-parity 
image. 13 

• A < 0, A_ < 0, A + > : Brightening of a negative-parity 
image. 
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• A < 0, A- < 0, A+ < 
parity image. 

a = a Q [2(1 



Dimming of a doubly negative- 



The expressions above are in exact agreement with the re- 
sults from Keeton (2003) in the limit | A| <C 1. In particular, the 
cross sections in Keeton (2003) contain terms that are higher or- 
der in A, not found in our expressions. However, as we already 
noted, these corrections are important only when astrometric 
perturbations become non-negligible. Since negligible astro- 
metric perturbations were also assumed in Keeton (2003), these 
higher-order terms are important only when both formalisms 
break down. 

Before moving on, it is worth noticing that while the cross 
sections we have derived depend on the parity of the image, 
this dependence is something of happenstance. In particular, 
what differentiates the various expressions for the magnifica- 
tion perturbation cross sections is not the image parity, but 
whether Equation (4) has zero, one, or two roots. For an SIS 

13 Even though the result we quote here looks quite different from the one 
presented in Equation 17 of Keeton (2003), it is simple to show that -K — 29g = 

cos -1 {— 1 1 ^ j , so that both expressions for cr(A) do indeed agree to leading 

order in 8 (8$ is defined in Keeton (2003)). A similar argument holds for the 
case A < 0, Ar < 0. X R > 0. 
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perturber, these conditions coincide with the parity of the im- 
age, but this is not true in general. Indeed, we already showed 
in Appendix B, that point masses do not distinguish between 
positive- and negative-parity images. 

Consider an ensemble of SIS perturbers, which we choose 
to parameterize through their Einstein radii b. Assuming the 
macroimage has positive parity, Equation (13) becomes 

/ds 
db—b 2 [l/5 m -l/A], (C6) 

which clearly diverges in the limit 8 m — > 0. The physical reason 
behind this is clear: each individual perturber within a distance 
9 from the image contributes a positive perturbation 5^1/9. 
Since the number of perturbers in such a ring scales as 9, the 
net perturbation is divergent. 

How generic are these divergences? If the magnification per- 
turbation is dominated by the convergence perturbation 5 k, then 
perturbers along a ring will give nonzero average perturbations 
that scale as 96k. Convergence is expected then if, and only if, 
5k falls faster than 1 /9 2 . Since any such perturber has a finite 
mass, we conclude that finite magnification perturbations are 
expected if and only if the perturbers have finite masses. 
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